Which Robust Versions of Sample Variance and Sample Covariance Are Most Appropriate for Econometrics: Symmetry-Based Analysis

نویسندگان

  • Songsak Sriboonchitta
  • Ildar Batyrshin
  • Vladik Kreinovich
چکیده

In many practical situations, we do not know the shape of the corresponding probability distributions and therefore, we need to use robust statistical techniques, i.e., techniques that are applicable to all possible distributions. Empirically, it turns out the the most efficient robust version of sample variance is the average value of the p-th powers of the deviations |xi − â| from the (estimated) mean â. In this paper, we use natural symmetries to provide a theoretical explanation for this empirical success, and to show how this optimal robust version of sample variance can be naturally extended to a robust version of sample covariance. 1 Formulation of the Problem Need to determine a parameter: traditional case. Often, we observe a sample of several instances x1, . . . , xn of a random variable X. In many practical situations, we know that the random variable X has a distribution with the probability density function (pdf) ρ(x) = ρ0(x−a), where ρ0(x) is a known function, and a is an unknown parameter. For example, X may be the measurement result, which can be represented as X = a+X0, where

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Which Robust Versions of Variance and Covariance Are Most Appropriate for Econometrics: Symmetry-Based Analysis

In many practical situations, we do not know the shape of the corresponding probability distributions and therefore, we need to use robust statistical techniques, i.e., techniques that are applicable to all possible distributions. Empirically, it turns out the the most efficient robust version of variance is the average value of the p-th powers of the deviations |xi − â| from the (estimated) me...

متن کامل

Conservative Sample Size Determination for Repeated Measures Analysis of Covariance.

In the design of a randomized clinical trial with one pre and multiple post randomized assessments of the outcome variable, one needs to account for the repeated measures in determining the appropriate sample size. Unfortunately, one seldom has a good estimate of the variance of the outcome measure, let alone the correlations among the measurements over time. We show how sample sizes can be cal...

متن کامل

Limiting Form of the Sample Covariance Eigenspectrum in PCA and Kernel PCA

We derive the limiting form of the eigenvalue spectrum for sample covariance matrices produced from non-isotropic data. For the analysis of standard PCA we study the case where the data has increased variance along a small number of symmetry-breaking directions. The spectrum depends on the strength of the symmetry-breaking signals and on a parameter α which is the ratio of sample size to data d...

متن کامل

Bayesian Sample size Determination for Longitudinal Studies with Continuous Response using Marginal Models

Introduction Longitudinal study designs are common in a lot of scientific researches, especially in medical, social and economic sciences. The reason is that longitudinal studies allow researchers to measure changes of each individual over time and often have higher statistical power than cross-sectional studies. Choosing an appropriate sample size is a crucial step in a successful study. A st...

متن کامل

Power Spectrum Super-Sample Covariance

We provide a simple, unified approach to describing the impact of super-sample covariance on power spectrum estimation in a finite-volume survey. For a wide range of survey volumes, the sample variance that arises from modes that are larger than the survey dominates the covariance of power spectrum estimators for modes much smaller than the survey. The perturbative and deeply nonlinear versions...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016